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ABSTRACT 


Health has become the utmost priority of people's lives after the whole pandemic situation. Many new 
emerging technologies are striving for better human livelihood. In technology, particularly Machine 
Learning has a great potential in making Human lifestyle better and much simpler. Machine learning has 
extended its arms in Marketing Banking, Smart life, Production and Manufacturing and widely in the health 
care sector.ML has a great impact on the healthcare industry ML is widely used for prediction of Cancer, 
prediction and identifying the damage in kidneys, heart disease predictions etc. By integrating the complex 
algorithms with real-time medical training datasets can produce high accurate results. Hospitals and doctors 
have been extremely busy during the COVID period. There was a time to avoid visiting hospitals during 
these times unless it was absolutely necessary. This application takes the common symptoms of the patient 
and tries to give possible predictions as a primary disease. The application is built by using the most 
famous algorithms in the ML domain like Decision Tree classification algorithm, Random Forest 
classification algorithm, Naive bayes classifier algorithm. This application shows the mechanism of three 
algorithms for disease prediction 
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1. INTRODUCTION the patient can select the symptoms and get the 
predictions just by a click. This was achieved by 
In today’s world, Technology is playing a key role ‘integrating the Machine learning algorithms with 
in the Healthcare industry , with evolving Flask 
technology and changing times things have not 
been the same and in fact have become much 2.LITERATURE REVIEW 
easier. Disease Prediction using Machine learning S Radhika, S Ramiya Shree, V Rukhmani 
is an attempt to do that. This Disease Prediction 
system basically uses the information that is 
provided by the users and comes up with a 
prediction. In broader terms, if any patient is 
feeling ill but they feel it’s not very serious 
momentarily , but they still want to know what they 
could be suffering from , they can just enter the 
symptoms and the Disease prediction system 
provides them with the possible disease they might 
be suffering from. This way, the patient does not 
have to go and consult a doctor even for small 
things and this saves a lot of time too and to most 
extent gives accurate results as well. This system 
can be built solely using Machine Learning 
algorithms using python and in order to make it 
more user friendly we have created a website where 


Divyadharsini, and A Ranjitha proposed and in 
which they considered a Decision tree algorithm for 
detection of diseases. In this they compared 
different algorithms for accuracy and plotted a 
graph and found out that Decision tree has shown 
better accuracy in comparison, which led us to 
choosing Decision Tree algorithm for our model. 


Md. Zahangir Alama, M. Saifur Rahman, M. 
Sohel Rahman, proposed on the prediction of the 
disease using random forest led us to using this 
algorithm. In this paper many datasets were given, 
and each dataset was classified based on diseases, 
so the accuracy was predicted. 

Mostafa Langarizadeh and Fateme Moghbeli 
proposed and in NCBI talked about applying 
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Naive Bayes Algorithm, a ROC curve has been 
plotted for the accuracy comparison the help of 
the symptoms given by the patient. 

Sneha Grampurohit and Chetan Sagarnal 
proposed disease prediction using machine 
learning algorithms has considered basically three 
algorithms for prediction: Naive bayes classifier, 
Random forest, Decision tree algorithm , all of 
which we used in our project. In this sample 
datasets from a standard resource have been 
considered and have been implemented on the 
system we got a clear idea with the help of that. 


3.STAMENT OF THE PROBLEM 


In recent times, with the emergence of Covid and 
various other reasons, the Healthcare industry 
has been one of the busiest industries globally. 
Hospitals and Doctors have been very busy with 
everything that's going on and giving consults to 
easily identifiable diseases is a waste of both time 
and resources . Also, it is not very advisable for 
people to go to hospitals unless it’s absolutely 
necessary as it may lead to unnecessary problems. 
So in order to overcome this problem , it would be 
really helpful if there was a way for the users or 
patients to know what they are suffering from just 
by providing some information about their 
symptoms. That is when we came up with the idea 
of a Disease prediction model through which the 
patient can just enter their symptoms and get 
predictions of what the disease might be. This 
could help a lot of people of all age groups. The 
objective of this project is to use a dataset directly 
retrieved from Kaggle and this is based on past 
hospital datasets, therefore on comparison the 
accuracy must increase. The main objective is to 
use the Machine learning algorithms and build 
models such that we get accurate disease 
predictions according to the symptom. 

Block Diagram of the proposed method 
can be seen as below 


Figure 1: Block Diagram Of Proposed Work 


4. PROBLEM METHODLOGY AND 
SOLUTION 


The Disease Prediction model using Machine 
Learning and Web Integration is implemented 
completely using Python. Flask which is used in 
web integration is also written using python. 
Machine learning gives computers the ability to 
learn. It gives the computers the ability to learn 
without being programmed explicitly 
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Figure 2 Overview Of Machine Learning 


Machine Learning Algorithms used for building 
this model are: 


4.1Decision Tree Algorithm: 

Decision tree is a tree structure which resembles a 
flowchart. 

It is a non- parametric approach which is used for 
building classification models and finding an 
optimal decision tree is a complete problem in the 
NP approach. 

The methods used for the development of 
decision trees are less expensive which makes it 
easier to develop models even when the training 
set size is large. 

Entropy formula for Decision tree algorithm is as 


below: 
ES) = -p logp .- p logp 
(+) (+) “(-) (-) 
And Information gain = E(Y)-E(Y/X) 
—_ ' Decision Tree 
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Figure 3 Example Of Decision Tree 
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4.2 Random Forest: eee thar Yes 

. F . F ung 2 Overcast 4 
Random Forest is a classifier which consists of Sunny No my? 3 

” 4a Overcast Yes - 

multiple decision trees. ie =a ; 
These decision trees operate together as an EE J 
ensemble. CORES NGS Likelihood Table 1 Likelihood Table 2 


Each tree gives a prediction and the one with the 

most votes becomes the final prediction. 

The prerequisites for random forest to perform 

well are: 

There needs to be some actual signal in our 

features so that models built using those features 

do better than random guessing. 

The predictions made by each of the trees must 
have low correlations with each other 


DECISION TREE-1 DECISION TREE-1 DECISION TREE-1 


RESULT-1 RESULT-2 RESULT-N 


— MAJORITY VOTING / AVERAGING __| 


Figure 4 Overview Of Random Forest 


4.3 Naive-Bayes Classifier: 

A group of classification algorithms built on the 
Bayes' Theorem are known as naive Bayes 
classifiers. It is a family of algorithms rather than a 
single method, and they are all based on the idea 
that every pair of features being classified is 
independent of the other. 

A straightforward mathematical procedure for 
computing conditional probabilities is known as 
Bayes' Theorem. 

The probability of an event happening given that 
another event has already occurred (via 
assumption, presumption, statement, or evidence) 
is known as conditional probability. 

The equation is: 


Probability of B occurring 
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Figure 5 Example Of Naive Bayes Classifier 


4.4 Flask: 

A Python package called Flask serves as a web 
framework that makes it simple to create web 
apps. Its core is compact and simple to extend; it's 
a micro framework. 

It does have a lot of great features, including a 
template engine and url routing. It is a web app 
framework for WSGI. 

Provides libraries to build lightweight web 
applications in python. 


isu 


Figure 6. Flask Logo 


5. Testing 

White Box Test cases: 

Check if the environment is set correctly. 

Check if all the codes are integrated and functional. 

If all the algorithms are functioning properly or not. 
Black Box Test cases: 

And in the process of testing, we also tested the 
website and its working quality. After compiling 
the app.py it compiled successfully and rendered 
the home.html on web browser in local host port 
http://127.0.0.1:5000/ 


us website where one can select the symptoms he'she is 
‘ing and accordingly the system shall predict a certain disease 
condition he'she might be suffering from. 


) Figure 7. Home Page 


On selecting the select symptoms it redirects to the 
index.html page where you can select symptoms 
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Disease Predictor 
? Final Predictions 
Select Symptoms 
Alogrithm Prediction 
DECISION TREE hepatitis A 
RANDOM FOREST hepatitis A 


Figure 8 Page To Select Symptoms 


Where we have selected 5 symptoms as the input 
by using dropdown and click select on Predict 
button 


Disease Predictor 


Final Predictions 


Alogrthm Prediction 

DECISION TREE Malena 

RANDOM FOREST Malena 

NAVIE BAYES Fungal infection 
e 


ae 
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Figure 9 Final Predictions 


Here are our final Results, Decision Tree is 
predicting Malaria ,Random Forest is predicting 
Malaria, Naive Bayes is Predicting Fungal 
Infection.After that by clicking home we will 
redirect to the home page. 

Test Cases 

As Testing case-1 1 gave symptoms “mild_ fever”, 
“neck_pain’, “runny_nose’, “cramps, 


“abdominal pain’, as input through dropdown 


boxes 


Select Symptoms 


Figure 9 Page To Select Symptoms 
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Figure 10 Page To Select Symptoms 


All the Algorithms show hepatitis A as the 
prediction in this case and here we assume that the 
patient is suffering from hepatitis A disease and the 
model is working Flawlessly with these inputs. 


8.RESULTS AND DISCUSSIONS 


To evaluate the models, we have a wide range of 
evaluation techniques in the machine learning 
domain. Here in this project, we have used the 
accuracy metric to evaluate all the 3 models. By 
calculating the accuracy, we get the following 
results: 


In [59]: NaiveBayes("aild fever”, "neck_pain”, “runny nose”, "cramps", “abdominal _pain") 


@,9512195121951219 


Jaundice 


Figurel1. Results About Disease 


9. CONCLUSION AND FURTHER SCOPE: 


So, we can conclude that on the basis of the 
symptoms entered by the user in the website we get 
three predictions from the three algorithms and we 
get an overall accuracy of 95.12 percent. Further 
scope of this project is that its features can be 
extended by giving the diagnosis for the prediction 
received, showing the nearby pharmacy etc. Also 
nearest hospitals or clinics which treat the predicted 
disease can be suggested. 


REFERENCES 


[1] Aldahiri, Amani, Bashair Alrashed, and 
Walayat Hussain. "Trends in using iot 
with machine learning in the health 


prediction system." Forecasting 3.1 
(2021): 181-206. 


3590 


Journal of Theoretical and Applied Information Technology 
30" April 2024. Vol.102. No 8 


© Little Lion Scientific 


ISSN: 1992-8645 


[2] 


Ahmad, Muhammad Aurangzeb, Carly 
Eckert, and Ankur Teredesai. 
"Interpretable machine’ learning in 
healthcare." Proceedings of the 2018 
ACM international conference on 
bioinformatics, computational biology, 
and health informatics. 2018. 

Radhika, S., et al. “Symptoms Based 
Disease Prediction Using Decision Tree 
and Electronic Health Record Analysis.” 
European Journal of Molecular Clinical 
Medicine, vol. 7, no. 4, 2020, pp. 2060- 
2067. 

Tu, P-L., and J-Y. Chung. "A new 
decision-tree classification algorithm for 
machine learning." TAI'92- Proceedings 
Fourth International Conference on Tools 
with Artificial Intelligence. JTEEE 
Computer Society,1992. 

Karthika, S., and N. Sairam. "A Naive 
Bayesian classifier for educational 
qualification." Indian Journal of Science 
and Technology 8.16 (2015): 1-5. 


3591 


we 
SY 
rutin 


E-ISSN; 1817-3195 


